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Background of Invention 
Field of Invention 



C [0001] This invention relates to template creation from scanned input such as the 
f*l creation of a document template. 



Description of Related Art 

[0002] Production printing such as that of a publisher of mass media or literary works 
often utilizes what is known as a makeready station. A makeready station allows a 
user to add value to printing projects by modifying or correcting existing documents 
or by creating new documents composed of heterogeneous inputs. Examples of added 
value include the addition of page numbers, the addition of headers and/or footers, 
the addition of digital watermarks and/or merge items, and merging scanned images 
with electronic format documents, such as page description language (PDL) pages. 

[0003] Various products exist with makeready functionality. For example, DocuTech 1 35 
by Xerox includes applications such as Document Scan and MakeReady that enable 
such functionality. 

Summary of Invention 

[0004] The systems and methods of this invention provide automated creation of a 
template for a page of an existing document. 
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[0005] The systems and methods of this invention separately provide automated creation 
of a template for a page of an existing document from a scanned input. 

[0006] The systems and methods of this invention separately provide a template for 
generating a page that is visually coherent with the page or pages of an existing 
document. 

[0007] According to various exemplary embodiments of this invention, an automated 

document template creation method comprises: scanning an existing document that 
contains a visual design to input an image; decomposing the image into at least one 
object using at least one image processing technique; determining at least one design 
attribute of the at least one object; and synthesizing a document template containing 
the at least one design attribute. 

[0008] In various embodiments, decomposing the image into at least one object 
comprises using an image segmentation technique. In various embodiments, 
decomposing the image into at least one object comprises using an image 
classification technique. In various embodiments, decomposing the image into at least 
one object comprises using an optical character recognition technique. In various 
embodiments, decomposing the image into at least one object comprises 
decomposing the image into at least one of a text region, a pictorial, line art and a 
color theme. 

[0009] In various embodiments, determining at least one design attribute of the at least 
one object comprises using an optical character recognition technique. In various 
embodiments, determining at least one design attribute of the at least one object 
comprises determining a page layout. In various embodiments, determining at least 
one design attribute of the at least one object comprises determining font 
information. 

[0010] 

According to various exemplary embodiments of this invention, an automated 
document template creation system comprises: a scanner that scans an existing 
document containing a representative visual design to obtain an image; an image 
processor that decomposes the image into at least one object and determines at least 
one design attribute of the at least one object; and a template creator that synthesizes 



APP ID-10063220 



Page 2 of 13 



m 

W 



m 



a document template containing the at least one design attribute. In various 
embodiments, the system further comprises a data storage device that stores the 
document template. 

[001 1] These and other features and advantages of this invention are described in, or are 
apparent from, the following detailed description of various exemplary embodiments 
of the systems and methods according to this invention. 

Brief Description of Drawings 

[0012] Various exemplary embodiments of the systems and methods of this invention 
described in detail below, with reference to the attached drawing figures, in which: 

[001 3] Fig. 1 is a schematic block diagram of a makeready station according to an 
exemplary embodiment of this invention; and 

[0014] Fig. 2 is an exemplary flowchart illustrating an exemplary method according to 
this invention. 

Detailed Description 

[001 5] The systems and methods according to this invention allow a user, such as a 
scan /preparation user in production printing, to add and/or replace pages in an 
existing document that has a predetermined visual design theme. For example, the 
visual design theme may comprise text treatments, margins, background colors and 
the like. Many reasons exist for creating additional pages. For example, the user may 
add content to the existing document for added value. Also, one or more pages of the 
existing document may be missing or damaged. For example, physical pages may be 
lost or damaged during handling of the document and electronic pages may be 
corrupted during transmission or conversion of the document. 

[001 6] Typically, a document creator uses one or more document templates when 

creating a document with a particular visual design theme. In production printing, the 
document templates are not conveyed with the document in the production cycle. 
Thus, the templates are not available to the scan/ preparation user. 



[001 7] 



Without the templates that were used to create the original document, the 
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scan/preparation user must manually identify each element of the visual design theme 
and use each element when creating additional or replacement pages for the existing 
document. Such manual identification and replication of the predetermined visual 
design theme is labor-intensive and introduces human error. 

[001 8] The systems and methods according to this invention allow the scan /preparation 
user generate additional and/or replacement pages for an existing document that are ' 
visually coherent with the predetermined visual design theme of the document. 
According to the invention, the creation of additional and/or replacement pages is 
automated so that manual labor and human errors are reduced. In various 
embodiments of the invention, the automated page creation is provided by automated 
creation of a template for a page of the existing document. In various embodiments, 
the template is created from a scanned input. 

[001 9] In various embodiments according to this invention, one or more pages of the 
existing document that contain the predetermined visual design theme are scanned. 
For each scanned page, the scanned image is decomposed using image processing 
techniques into objects, such as, for example, text regions, zones for page layout, 
pictorials, line art, background patterns and the like. Attributes of or information 
about the objects, such as, for example, font type, font size, paragraph spacing, line 
spacing, justification and the like, are/is acquired using image processing techniques. 
Further, background color and the like may be determined. 

[0020] Once the attributes of the objects are identified, a template containing the 

attributes is synthesized. Once created, the template is saved in any desired template 
format for use with word processing software or the like. The template can then be 
used to create additional/replacement pages for the existing document that are 
visually coherent with the predetermined visual design theme of the document. 

[0021] | n various exemplary embodiments according to this invention, the image may be 
decomposed into objects using any known or hereafter developed image 
segmentation techniques. For example, see U.S. Patents Nos. 5,293,430 to Shiau et 
al., 5,767,978 to Revankar et al., 5,850,474 to Fan et al., 6,1 75,427 to Lehmbeck et 
al. and 6,1 81 ,829 to Clark et al., each of which is incorporated herein by reference in 
its entirety. Alternatively or additionally, the image may be decomposed into objects 
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using optical character recognition (OCR) software, such as Textbridge Pro 98 by Scan 
Soft or Acrobat Capture by Adobe Systems Incorporated. It should be understood that 
any known or hereafter develop software that is capable of decomposing an image 
according to this invention may be used. 

[0022] In various exemplary embodiments, optical character recognition (OCR) 
techniques may be used, in conjunction with other techniques, to acquire the 
attributes of or information about the objects, such as font type, font size, paragraph 
spacing, line spacing, justification and the like. For example, products noted above 
that convert scanned images into formatted electronic representations of the image 

In 

O typically provide this capability, including optical character recognition (OCR). Further, 

J| background color regions may be determined using background detection algorithms 

|jl within the color extended image processing of Chip Set 95. Similar techniques may 

also be used to acquire font colors and the like. 



in 



[0023] Fig. 1 shows a makeready station 1 00 according to an exemplary embodiment of 
the invention. As shown in Fig. 1, the makeready station 100 comprises a processor or 

computer 1 02, an image capture device or scanner 1 04, a data storage device 106 

Ml . 

Q a "d a printer or output device 108. The scanner 104, the data storage device 106 and 
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the printer 1 08 are in communication with the computer via a communication link 
101. 

[0024] The makeready station 1 00 can be implemented using a programmed general 
purpose computer. However, the makeready station 1 00 can also be implemented 
using a special purpose computer, a programmed microprocessor or micro-controller 
and peripheral integrated circuit elements, an ASIC or other integrated circuit, a digital 
signal processor, a hardwired electronic or logic circuit such as a discrete element 
circuit, a programmable logic device such as a PLD, PLA, FPGA or PAL, or the like. In 
general, any device, capable of implementing a finite state machine that is in turn 
capable of implementing the exemplary flowchart shown in Fig. 2 can be used to 
implement the makeready station 100. 

[0025] The communication link 101 shown in Fig. 1 can be any known or later developed 
device or system for connecting various devices to a computing device such as the 
computer 1 02, including a direct cable connection, a connection over a wide area 
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network or a local area network, a connection over an intranet, a connection over the 
Internet, or a connection over any other distributed processing network or system. In 
general, the communication link 101 can be any known or later developed connection 
system or structure usable to connect devices and facilitate communication. 

[0026] Further, it should be appreciated that the communication link 1 01 can be a wired 
or wireless link to a network. The network can be a local area network, a wide area 
network, an intranet, the Internet, or any other distributed processing and storage 
network. 

[0027] It should also be appreciated that the data storage device 1 06 can be any known 



or later developed device or system capable of storing electronic information. Further, 
while the exemplary makeready station 1 00 is shown with the scanner 1 04, it should 



inputing data of an existing document may be used. 

[0028] Fig. 2 is a flowchart illustrating an exemplary embodiment of an automated 

document template creation method according to this invention. Beginning in step 
SI 000, operation continues to step SI 1 00, where data of an existing document is 



scanning the existing document. Then, in step SI 200, the image is segmented into 
objects and each object is classified by location and type. 

[0029] After segmentation and classification, each object is processed to extract object 
information in step SI 300. Next, in step SI 400, a determination is made whether the 
extracted object information warrants definition of one or more subobjects. For 
example, a text region may be sub-classified into a region containing red 10 point 
Arial font and another region containing black 1 6 point Times Roman font. If so, 
operation continues to step SI 500 where one or more of the objects is partitioned 
into smaller objects or subobjects. Operation then returns to steps SI 200-S1 400 to 
process the subobjects. 




be appreciated that any known or later developed device or system capable of 




received. For example, image data of the existing document may be captured by 



[0030] 



If no definition of subobjects is warranted in step SI 400, operation jumps to 
SI 600, where the extracted object information is merged with the object location. 
Then, in step SI 700, the merged data is used to build a template for the existing 
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document. Once the template has been created, operation continues to step SI 800, 
where the process ends. 

[0031] While this invention has been described in conjunction with various exemplary 
embodiments, it is to be understood that many alternatives, modifications and 
variations would be apparent to those skilled in the art. Accordingly, Applicants intend 
to embrace all such alternatives, modifications and variations that follow in the spirit 
and scope of this invention. 

[0032] For example, it should be understood that the configuration of the makeready 
station is illustrative and not limiting. The image capture device may be 
geographically separated from the processor of the makeready station. As noted 
above, the image capture device may be connected to the processor via any means of 
data transmission or image data from the image capture device could be conveyed via 
removable media, such as a diskette. 
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